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Abstract — The  channel  for  a  Comm-on-the-move  (COTM)  ter¬ 
minal  in  a  blockage  environment  communicating  over  a  satellite 
can  be  characterized  as  a  packet  erasure  channel  with  long 
channel  memory  and  long  feedback  delay.  The  goal  of  our 
research  is  to  enable  real-time  application,  such  as  a  two-way 
voice  call,  over  such  a  challenging  channel.  These  packets  need 
to  be  delivered  reliably  with  strict  order  requirement.  While 
various  ARQ  techniques  are  often  used  for  this  purpose,  they 
become  ineffective  when  the  channel  is  severely  blocked  with 
long  memory  and  long  feedback  delay,  and  the  user  desires  delay 
performance  that  is  only  a  small  amount  above  the  physical  limit 
of  the  channel  while  not  consuming  too  much  bandwidth. 

We  propose  a  simple  periodic  retransmit  solution  for  this 
problem.  For  example,  for  a  50%  blocked  channel  with  the  round 
trip  time  being  about  the  same  as  the  channel  memory  duration, 
we  can  achieve  average  packet  delay  that  is  less  than  one  channel 
memory  duration  over  the  minimum  possible,  while  being  more 
than  50%  efficient  with  the  bandwidth  consumed. 

For  this  simple  periodic  retransmit  scheme,  we  analyze  the 
expected  delay  of  a  packet,  and  compute  the  difference  from 
the  best  possible  delay.  We  evaluate  the  bandwidth  consumption 
and  show  a  delay-throughput  tradeoff  We  also  suggest  rules  for 
selecting  operating  points  that  balances  delay  and  throughput. 
Furthermore,  we  characterize  user  perception  metrics  of  expected 
duration  and  frequency  of  playback  interruptions.  Simulations 
are  used  to  validate  the  analysis. 

Index  Terms — real-time  communication;  communications-on- 
the-move  (COTM);  satellite;  packet-loss  channel;  erasure;  block¬ 
age;  channel  with  memory;  Markov  model;  delay;  throughput; 
link-layer  retransmission;  ARQ; 

I.  INTRODUCTION 

The  channel  for  a  Comm-on-the-move  (COTM)  terminal  in 
a  blockage  environment  communicating  over  a  satellite  can  be 
characterized  as  a  packet  erasure  channel  with  long  channel 
memory  and  long  channel  propagation  and  feedback  delay. 
The  goal  of  our  research  is  to  enable  real-time  streaming, 
such  as  video  or  voice,  over  such  a  challenging  channel.  The 
stream  may  be  a  long  stream,  such  as  a  surveillance  video 
for  situational  awareness,  or  short  bursts,  as  in  conversations. 
A  fundamental  limitation  on  the  delay  performance  is  due  to 
the  physical  channel  blockage,  i.e.,  packets  generated  during 
channel  blockage  cannot  get  through  while  the  channel  is 
blocked  and  must  wait  for  channel  to  open  up.  We  would 

This  work  was  sponsored  by  the  Department  of  Defense  under  Air  Force 
Contract  FA8721-05-C-0002.  Opinions,  interpretations,  conclusions,  and  rec¬ 
ommendations  are  those  of  the  authors  and  are  not  necessarily  endorsed  by 
the  United  States  Government 


like  to  have  a  retransmission  strategy  that  can  achieve  delay 
performance  that  is  only  a  small  amount  above  the  best 
possible  and  yet  not  consume  a  large  amount  of  bandwidth 
with  unnecessarily  frequent  retransmissions. 

Traditionally,  ARQ’ methods  have  been  used  for  reliable 
delivery  of  ordered  packets  over  unreliable  links.  However, 
known  ARQ  methods  may  not  work  well  for  severely-blocked 
long-memory  long-delay  channels  and  may  not  work  well  for 
packets  that  are  generated  and  consumed  sequentially  in  real 
time  with  low  desired  delay.  For  examples,  for  an  interactive 
voice  application  (one  packet  per  20  ms),  users  may  desire  a 
delay  performance  of  no  more  than  a  few  seconds,  even  when 
the  channel  delay  and  channel  blockage  memory  are  both  on 
the  order  of  a  second  and  the  channel  is  blocked  50%  of  the 
time. 

Basic  ARQ  techniques,  stop-and-wait  ARQ,  go-back-N 
ARQ,  and  selective-repeat  ARQ,  become  inefficient  as  the 
channel  loss  rate  becomes  higher  than  0.1%  [1].  Hybrid  ARQ 
schemes  that  utilize  forward  error  correction  (FEC)  coding 
were  designed  to  reduce  the  effective  channel  loss  rate  [l]-[3]. 
When  the  channel  degradation  is  mainly  due  to  noise,  FEC  can 
be  quite  effective.  However,  for  blockage  channels,  the  FEC 
would  have  to  span  many  (possibly  tens)  independent  channel 
realizations  to  achieve  the  same  effect.  When  the  channel 
memory  is  long,  the  coding  would  induce  a  long  delay  and 
a  high  complexity.  Without  coding,  the  ARQ  mechanism  still 
needs  to  deal  with  the  high  packet  loss  rate.  In  [4],  a  hybrid 
ARQ  technique  was  used  to  support  interactive  voice  over  a 
moderately  blocked  satellite  channel  as  a  COTM  terminal  was 
driven  in  a  city  environment,  and  the  user  experience  was  poor. 

Furthermore,  most  ARQ  schemes  focus  on  the  throughput 
achieved  by  the  system  rather  than  delay.  In  all  ARQ  schemes, 
including  the  hybrid  ARQ  schemes,  retransmissions  are  only 
performed  after  a  full  round  trip  time  (RTT).  When  the  channel 
loss  rate  is  high,  there  is  a  non-negligible  probability  that  it 
would  take  several  RTT  for  a  packet  to  get  through.  When 
ordered  packet  delivery  is  required,  one  such  packet  would 
delay  all  packets  that  come  after.  For  real-time  applications, 
it  is  desirable  to  achieve  delay  that  is  only  a  few  RTT  or 
even  less  than  one  RTT.  If  the  physical  channel  is  blocked  for 
multiple  seconds,  the  delay  should  be  just  a  little  bit  beyond 
that.  To  achieve  this  level  of  low  delay  over  a  severely-blocked 
long-memory  long-delay  channel,  a  different  kind  of  scheme 


is  needed. 

Our  prior  work  [5]— [7]  studied  this  problem  without  the 
channel  memory  consideration;  the  blockages  were  modeled  as 
independent  and  identically  distributed  (HD)  events.  A  multi¬ 
burst  transmission  scheme  was  proposed  where  a  number  of 
retransmissions  are  sent  over  independent  channel  realizations 
in  one  RTT,  and  the  number  increases  as  we  get  close  to 
a  desired  delay  target.  The  scheme  has  the  option  to  yield 
low  delay  very  close  to  the  physical  channel  limit  by  using 
a  large  number  of  retransmissions.  We  showed  that  there 
is  a  fundamental  tradeoff  between  the  delay  achieved  and 
bandwidth  consumption,  we  could  trade  delay  for  bandwidth 
save  and  vice  versa. 

This  work  addresses  the  scenario  with  long  channel  mem¬ 
ory,  which  could  last  for  many  seconds  [4].  In  this  case,  the 
channel  memory  is  longer  than  or  on  the  same  order  as  channel 
delay  and  the  duration  of  some  short  messages.  One  possible 
scenario  is  where  the  channel  memory,  channel  delay,  and 
stream  duration  are  all  on  the  order  of  one  to  a  few  seconds. 

In  this  paper,  we  propose  a  rather  simple  periodic  retransmit 
scheme  for  the  problem.  It  is  likely  not  the  best  that  can 
be  done,  but  we  will  show  that  it  has  reasonably  good 
performance  when  the  retransmission  interval,  5,  is  chosen 
appropriately.  Generally,  5  that  is  too  large  would  lead  to  delay 
much  larger  than  the  physical  limit,  and  a  5  that  is  too  small 
relative  to  the  channel  memory  would  gain  little  in  delay  and 
yet  incur  a  lot  of  bandwidth  consumption. 

This  paper  is  outlined  as  follows.  In  Section  II,  the  system 
model  is  given  including  the  model  for  the  channel  memory. 
In  Section  III,  we  analyze  the  expected  delay  of  a  packet 
and  compute  the  difference  to  the  best  delay  achievable  by 
any  scheme,  which  is  only  limited  by  the  physical  channel 
blockages.  In  Section  IV,  the  bandwidth  requirement  is  evalu¬ 
ated.  Section  V  shows  the  tradeoff  between  the  delay  achieved 
and  the  bandwidth  requirement,  while  Section  VI  suggest 
rules  for  selecting  operating  points  depending  on  the  channel 
blockage  probability  and  how  the  channel  delay  compares 
with  the  channel  memory  duration.  We  show  that  it  is  indeed 
possible  to  achieve  delay  that  is  slightly  above  the  physical 
limit  with  reasonable  bandwidth  consumption.  For  example, 
for  a  50%  blocked  channel  with  the  round  trip  time  being  about 
the  same  as  the  channel  memory  duration,  we  can  average 
achieve  packet  delay  that  is  less  than  one  channel  memory 
duration  over  the  minimum  possible,  while  being  more  than 
50%  efficient  with  the  bandwidth  consumed.  In  Section  VII, 
we  analyze  the  a  few  user  perception  metrics  such  as  expected 
duration  and  frequency  of  playback  interruptions.  Section  VIII 
presents  simulation  results  that  validate  the  analysis.  SectionIX 
provides  some  conclusing  remarks. 

II.  System  Model 

The  system  block  diagram  is  shown  in  Fig.  1  It  is  very 
similar  to  the  one  used  in  [5]-[7],  expect  modified  to  have  a 
continuous  source  and  a  channel  with  memory. 

A  source  generates  a  constant-rate  stream  of  duration  L 


from  time  0  to  time  L.  1  The  transmitter  transmits  on  integer 
multiples  of  a  time  step  <5,  which  is  the  one  and  only  design 
parameter.  Source  data  from  time  ( k  —  1)5  to  k8,  k  = 
1, 2, 3,  ■  ■  -  ,K,  where  K  =  \L/S\ ,  are  bundled  into  a  packet 
Pk-  Each  packet  pk  is  immediately  transmitted  at  time  k8,  and 
repeatedly  retransmitted  at  times  (k  +  1)5,  (k  +  2)5,  •  •  • ,  until 
an  acknowledgment  (ACK)  for  packet  pk  is  received.  These 
(re)transmissions  are  carried  out  for  all  packets  independently 
and  simultaneously.  For  example,  at  time  35,  if  pi  and  p2 
have  not  be  acknowledged,  then  pi,  p2,  and  p3  are  transmitted 
together. 

One  aspect  of  the  transmission  scheme  that  differs  from  tra¬ 
ditional  ARQ  scheme  is  the  dynamic  bandwidth  consumption, 
where  the  number  of  packets  sent  each  time  changes.  This 
is  possible  when  the  stream  of  interest  shares  resources  with 
other  lower  priority  traffic,  thus  able  to  use  more  resources 
when  needed.  This  feature  is  beneficial  as  it  allows  the 
transmitter  to  not  delay  transmitting  new  packets  when  an  old 
packet  needs  to  be  retransmitted.  It  also  allows  for  adapting  to 
channel  variations  in  real  time  -  when  the  channel  is  relatively 
more  open,  there  would  be  fewer  retransmissions. 

The  erasure  channel  with  channel  memory  is  modeled  using 
a  continuous-time  two-state  Markov  model,  as  shown  in  Fig.  x. 
Let  U  and  B  denote  the  channel  open(unblocked)  and  blocked 
states.  One  common  way  to  specify  a  continuous-time  two- 
state  Markov  model  is  using  the  average  duration  of  time  spent 
in  each  state,  i.e.,  Tjj  and  Te-  An  alternative  way  is  to  use 
the  channel  open  probability  p  and  channel  memory  duration 
To,  where 


p=^- 

^  Tu  +  Tb 


and  To  = 


TvTb 

Tu+Tb' 


The  steady  state  probability  of  being  in  each  state  is 


(1) 


Pu  =  p  and  Pb  =  1  -  p.  (2) 

Let  s(t)  denote  the  channel  state  at  time  f.  The  state  transition 
probabilities  across  time  r  >  0  are 

Puuir )  =  Pr{s(t  +  r)  =  l\s(t)  =  1}  =  p+  (1  -  p)e~T/T°, 

Pbb{t )  -  Pr{s(f  -I-  r)  =  0|s(t)  =  0}  =  I-  p  +  p  e_T/T° , 

Pub(t )  =  1  -  Puuir),  and  PBu(t)  =  1  -  Pbb{t).  (3) 

Any  transmission  made  at  time  t  reaches  the  receiver  after 
a  delay  of  dc  if  s(t)  =  1,  or  is  blocked  if  s(t)  =  0.  The 
transmitter  learns  the  channel  state  through  an  error-free  packet 
acknowledgment  fed  back  after  a  further  delay  of  dc.  The 
effective  round  trip  time  (RTT),  i.e.,  the  time  it  takes  for  a 
packet  loss  to  be  responded  by  a  new  transmission,  is  RTT  = 
(L2dc/5J  + 1)5. 

Even  though  the  channel  model  is  continuous-time,  since 
the  transmitter  only  transmits  on  integer  multiples  of  5,  the 
effective  channel  is  really  a  discrete-time  one.  However,  as 
the  time  step  5  is  a  variable,  it  is  necessarily  to  define  the 


'We  assume  the  source  symbol  or  packet  interval  is  sufficient  small,  so  the 
discrete  effects  are  negligible. 
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Channel  state  feedback  delay  2 dc 


Fig.  1.  Streaming  system  block  diagram. 


model  in  continuous-time  terms  to  keep  the  discrete  channel 
models  consistent  as  <5  changes. 

As  stated  earlier,  this  study  focus  on  the  case  where  the 
channel  memory  To  is  longer  or  on  the  same  order  as  dc.  The 
message  duration  L  could  be  longer  than  or  shorter  than  T0. 
In  the  sequel,  we  will  often  need  to  express  time  quantities 
relative  to  To,  such  as  in  (3).  For  notational  convenience,  we 
use  ~  to  represent_time  quantities  scaled  by  T0.  For  example, 
dr  —  dc/To  and  L  =  L/Tq ■ 

The  receiver  process  the  received  packets,  sorts  them,  and 
eliminates  duplicate  packets  received.  It  passes  the  packets 
sequentially  to  the  playback  buffer.  It  also  sends  acknowl¬ 
edgments  back  to  the  transmitter  over  the  error-free  feedback 
channel. 

At  the  playback  buffer,  it  simply  plays  back  the  received 
packets  once  per  time  step  8.  In  the  event  that  there  is  no 
packet  to  playback,  which  might  happen  after  a  long  blockage, 
there  would  be  an  interruption  in  playback.  When  the  channel 
opens  again,  there  would  typically  be  a  burst  of  packets 
received  simultaneously.  The  oldest  (smallest  index)  of  those 
packets  is  played  back  first,  all  later  packets  are  buffered  and 
played  back  sequentially.  Consequently,  after  each  packet  is 
received,  it  cannot  be  played  back  until  all  the  packets  that 
have  been  buffered  are  played  back.  Although  this  causes 
additional  delay,  one  benefit  is  that  a  short  blockage  after  a 
longer  blockage  had  occurred  does  not  cause  an  interruption 
in  playback,  as  there  are  buffered  packets  to  playback  to  help 
“ride  over”  the  short  blockage.  One  potential  way  to  reduce 
delay  is  to  allow  playback  to  happen  in  a  “catch  up”  mode, 
where  the  packets  are  played  back  slightly  faster  (e.g.,  10%) 
than  normal.  While  we  do  not  assume  such  capability  in  this 
paper,  it  could  be  useful  in  practice. 

III.  Delay  Analysis  -  From  a  Packet  Perspective 

We  use  a  similar  packet  delay  definition  as  in  [5]-[7].  In 
particular,  the  delay  experienced  by  packet  pk  is 

Dk  —  Mk  —  k5 ,  (4) 

where  Mk  denotes  the  time  pk  is  played  back  and  k8  is  the 
time  pk  is  generated.  For  example,  pi  is  generated  at  time  8 
and  immediately  transmitted.  If  s(8)  =  1  (channel  open),  then 
Pi  is  received  at  dc  +  8  and  playback  finishes  at  time  dc  4-  28, 
so  D\  =  dc  +  28  —  8  =  dc  +  6.  This  is  the  minimum  delay 
every  packet  must  experience. 

More  generally,  a  packet  Pk  is  generated  at  time  k8,  and  is 
played  back  at  time  dc+(l+k+Bk)5,  where  Bk  is  the  longest 


stretch  of  consecutive  blockage  experienced  by  any  packet  up 
to  and  include  p*.  This  was  shown  in  [5]  -[7],  The  intuition 
is  that  if  any  packet  pj,  1  <  j  <  k  experiences  Bk  blockage 
consecutively,  Pj  itself  suffers  a  delay  of  dc  +  (1  +  Bk)8.  All 
packets  after  it  must  suffer  at  least  this  delay. 

The  average  packet  delay  for  pk  can  be  calculated  using 

OO 

E[Dk\  =  dc  +  8  +  8  ]T  Pr{Bk  >  b}.  (5) 

6=1 

To  evaluate  Pr{Bk  >  6},  we  see  that  in  order  to  have  6 
consecutive  zeros,  the  channel  sequence  must  take  the  form 

p  0*  •  •  • 
b 

or 

* *10 ••  -O*---*,  for i  =*  1,2,* •*  ,k  1. 

i— 1  b 

The  probability  of  having  a  row  of  b  blockages  at  the 
beginning  is  PBPaa{8)b~1-  For  each  i  =  1,2, ,k  —  1, 
the  probability  of  having  a  particular  pattern  of  this  form  is 
PuPubPb~b  ■  Using  the  union  bound, 

Pr{Bk  >b}<  (6) 

min(l,  PBPBB{8)b -1  +  (k  -  1)  PuPUBP bB~Bl. 

Combine  (3),  (5)  and  (7),  the  expected  delay  of  an  entire 
stream  of  duration  L,  D(L)  =  DK~ \l/s\,  can  be  upper 
bounded  by 

OO 

E\D{L)\  <Tc  +  8  +  8J2 min  (1,(1 -,:>)•  (7) 

6=1 

(l  -  P  +  P  ■  e-*)"-1  (l  +  {\m  -  1)  p{  1  -  e-*))) 

The  right  hand  side  contains  the  channel  parameters,  p  and 
dc,  the  stream  duration  L,  and  the  time  step  8,  which  is  a 
design  parameter.  Fig.  2  shows  the  excess  delay  above  the 
channel  delay,  E[D(L)]  —  dc,  as  a  function  of  L  in  log  scale, 
for  p  -  0.5  and  8  =  0.01,  0.1,  0.5,  and  1.0. 

Looking  at  one  curve  at  a  time,  each  curve  starts  off  flat  for 
L  <  1.  This  is  because  when  the  stream  duration  is  shorter 
than  the  channel  memory,  the  entire  stream  would  essentially 
see  the  same  channel,  and  it  is  the  average  channel  blockage 
duration  that  dominates  the  delay.  Indeed,  the  starting  point 
is  approximately  +  ^8  for  8  <  1.  2  The  first  term 

2The  exact  starting  point  is  1-pe  6  ss  S2  H - 
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Fig.  2.  Packet  delay. 


is  essentially  PbTb,  the  probability  that  the  stream  sees  a 
blocked  channel  when  it  is  started  times  the  expected  blockage 
duration. 

For  large  L,  delay  grows  linearly  with  log(L),  similar  to  the 
conclusion  in  [5]— [7].  This  is  because  with  a  two-state  Markov 
model,  the  blockage  duration  is  exponentially  distributed,  so 
statistically,  it  would  take  an  exponentially  longer  stream  to 
hit  a  blockage  that  is  a  fixed  amount  longer.  The  asymptotic 
slope  of  each  curve  with  respect  to  ln(L)  is  approximately 
p  +  for  5  <  1.  3  4 

Compare  the  different  curves,  lower  delays  are  achieved  for 
smaller  values  of  5.  In  the  limit  of  5  — »  0,  the  transmitter 
essentially  tries  to  transmit  continuously  (unrealistic,  as  it 
would  consume  a  prohibitively  large  amount  of  bandwidth),  so 
every  packet  is  put  through  the  channel  as  soon  as  the  channel 
opens  after  the  source  generates  it,  and  thus  achieving  the 
minimum  delay  physically  possible.  The  bottom  curve  with 
5  =  0.01  essentially  corresponds  to  this  limit.  Compare  to 
this  bottom  curve,  the  delay  gradually  increases  as  S  increases. 
5  —  0.1  has  a  very  small  gap  to  the  minimum  delay.  5  -  0.5 
suffers  additional  delay  of  less  than  0.9  for  L  <  10;  even  at 
L  =  100,  the  additional  delay  is  less  than  1.4,  or  just  1.4% 
of  the  stream  length.  For  L  >  100,  the  gap  grows  at  a  rate  of 
0.65  per  decade.  8  =  1.0  suffers  about  twice  as  much  delay 
degradation  as  8  =  0.5.  The  steps  in  the  curves  are  due  to  the 
[L/5]  term  in  (7). 

Note  that  packet  delay  is  independent  of  the  channel  RTT, 
as  the  retransmission  interval  is  8  rather  than  RTT  as  in  most 
traditional  ARQ  schemes. 

Next,  we  define  a  delay  metric  (DM)  that  captures  the 
differences  in  delay.  Let  Dmm(L)  be  the  minimum  packet 
delay  achieved  with  5  =  0.  This  represents  the  physical  limit 
of  the  channel.  We  define 

DM  =  HmO(L)- D™(L).  (8) 

It  is  defined  for  L  — *  0  so  it  is  independent  of  L.  As  shown 
in  Fig.  2,  when  L  is  small,  the  difference  is  about  the  same 

3The  exact  slope  with  respect  to  ln(L)  is - — — =-. 

ln(l-p+pe-*) 

4 The  knee  of  the  curve  is  at  ^  _p  ^  for  <5  =  0. 


as  at  L  — >  0;  when  L  is  large,  although  the  gap  grows,  it 
grows  logarithmically  at  a  slow  rate.  In  the  particular  example 
of  p  =  0.5  and  8  =  0.5,  the  rate  was  just  0.65  per  decade. 

IV.  Throughput  Metric 

To  capture  the  bandwidth  consumption,  we  compute  the 
throughput  metric,  originally  defined  in  [5]— [7] .  In  this  setup, 
the  throughput  metric  TM  is  the  ratio  between  the  expected 
total  number  of  packets  received  and  the  total  number  of 
packets  that  need  to  be  sent.  As  packet  retransmissions  are 
individually  handled,  this  is  equivalent  to  the  expected  number 
of  times  each  packet  is  received  by  the  receiver.  1/TM  is 
essentially  bandwidth  efficiency.  The  minimum  TM  is  1,  as 
each  packet  must  be  received  at  least  once.  As  there  may  be 
unnecessary  retransmission  due  to  the  long  feedback  delay, 
TM  could  be  much  larger  than  1  leading  to  low  efficiency.  For 
example,  if  each  packet  is  expected  to  be  received  5  times  on 
average,  the  efficiency  is  20%.  The  reason  we  define  TM  based 
on  reception  rather  than  transmission  is  that  the  minimum  TM 
would  be  1  independent  of  channel  blockage,  even  when  the 
channel  is  severely  blocked. 

To  compute  TM,  we  see  that  after  each  packet  is  first 
successfully  transmitted  through  the  channel,  while  waiting 
for  the  ACK,  the  transmitter  will  continue  transmission  for 
RTT/5  —  1  extra  times.  Therefore, 


KTT/S-l 

TM  =  1  +  J2  (p+(l-p)e~iS) 

i= 1 

KTT/S-1 

=  X)  (p + (i  -  p)^'5) 

1=0 


=  p- 


RTT 

5 


+  (1  -  P)  • 


1  -  e~RTT 
1  —  e  * 


(9) 


When  8  =  RTT,  the  scheme  becomes  standard  ARQ,  where 
the  transmitter  waits  for  a  full  RTT  to  ensure  that  the  packet 
is  indeed  lost  before  retransmission,  and  the  minimum  TM  of 
1  is  achieved.  In  (9),  we  see  that  the  first  term  contains  1/5. 
5  When  5  is  small,  TM  would  grow  like  1/5.  Therefore,  very 
small  5  should  be  avoided. 

Fig.  3  shows  the  throughput  metric  as  a  function  of  5  for 
p  =  0.5  and  different  values  of  RTT.  For  each  RTT,  TM 
decreases  with  5.  When  5  =  RTT,  TM  =  1. 


V.  Delay-Throughput  Tradeoff 
In  Section  III,  we  showed  that  packet  delay  suffer  some 
degradation  with  increasing  5.  In  Section  IV,  we  showed  that 
bandwidth  utilization,  i.e.,  the  throughput  metric,  could  be  very 
large  when  5  is  made  too  small.  In  this  section,  we  study 
the  tradeoff  between  the  delay  achievable  and  the  bandwidth 
required. 

Fig.  4  shows  the  DM-TM  tradeoff  for  the  case  of  p  =  0.5 
and  RTT  =  1.  The  curve  shows  the  various  DM/TM  pairs 


5  In  the  second  term,  1  —  e  5  «  <5  when  <5  is  small.  So  effectively,  the 
second  teim  also  contains  1/5. 


p  =0.5 


VI.  Retransmit  Interval  Selection 


Fig.  3.  Throughput  Metric 
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achieved  as  8  takes  the  values  of  0.01,  0.1,  0.2,  0.5,  and  1.0. 
While  the  DM  is  defined  with  L  — >  0,  for  larger  values  of 
L  such  as  100  and  1000,  we  saw  that  the  shape  of  the  curve 
remains  the  same  with  the  delay  axis  scaled  by  a  factor  of 
2.2  and  3.0,  respectively.  The  last  point  with  <5  =  1.0  =  RTT 
achieves  the  minimum  TM  of  1.  The  second  last  point  with 
5  =  0.5  =  p  achieves  a  DM  of  0.71  and  a  TM  of  1.8,  which 
corresponds  to  about  ~  55%  efficiency. 

Another  way  to  get  some  intuition  on  the  delay-throughput 
tradeoff  is  through  looking  at  approximate  expressions  of  DM 
and  TM  instead  of  the  exact  expressions.  For  DM,  we  have 
(see  second  paragraph  below  (7)) 

DM  re  (10) 

2  P  P 

For  TM,  the  worst  that  could  happen  is  that  all  RTT [8  —  1 
retransmissions  are  received,  so 


P- 


RTT 

8 


<  TM  < 


RTT 
8  ' 


(ID 


So  DM  is  proportional  to  <5  and  TM  is  inversely  proportional 
to  8.  While  this  is  not  exact,  it  provides  a  good  intuition. 


In  the  last  section,  we  studied  the  delay-throughput  tradeoff. 
In  this  Section,  we  provide  some  rule-of-thumb  recommenda¬ 
tion  on  how  to  select  the  retransmit  interval  5  for  different 
scenarios. 

1)  8  should  never  be  more  than  RTT.  The  minimum  TM 
of  1  is  achieved  when  8  —  RTT,  increasing  <5  further 
would  only  hurt  delay. 

2)  When  RTT  «  1,  use  S  =  RTT.  This  is  essentially  the 
ideal  selective-repeat  ARQ. 6 

3)  When  RTT  re  1,  for  light  to  moderate  blockage  with 
0.5  <  p  <  1,  use  8  =  p.  This  achieves  DM  between 
0.75  and  1  and  achieves  TM  between  RTT  and  2RTT. 

4)  When  RTT  re  1  and  severe  blockage  with  p  <  0.5,  use 
<5  =  0.5  if  willing  to  suffer  delay  from  0.25 /p  to  0.5/p; 
use  8  =  p  if  willing  to  pay  bandwidth  up  to  RTT/p. 

5)  When  RTT  »  1,  use  time  step  8  between  1  and 
2,  so  the  transmissions  see  nearly  independent  channel 
realizations.  Also,  instead  of  retransmit  every  5,  do 
transmission  only  a  few  times  per  RTT  according  to  the 
multi-burst  transmission  strategy  in  [5] — [7]. 

In  practice,  we  may  not  have  accurate  knowledge  of  the 
channel  parameters  p,  dc,  and  T0.  In  this  case,  we  could 
utilize  the  approximations  of  DM  and  TM  in  (10)  and  (1 1) 
and  select  5  that  leads  to  acceptable  DM  and  TM  given  the 
channel  parameter  ranges. 

Another  general  rule  is  that  when  the  primary  performance 
metric  is  low  delay,  the  retransmission  interval  should  scale 
with  channel  memory;  while  if  the  the  primary  performance 
metric  is  high  throughput  efficiency,  the  retransmission  interval 
should  scale  with  the  channel  RTT.  Indeed,  in  traditional 
ARQ  techniques,  the  primary  performance  metric  is  high 
throughput  efficiency,  and  retransmission  typically  occur  once 
every  RTT.  In  the  scenario  of  interest  in  this  paper,  we  want 
to  support  real-time  applications  with  low  delay  requirement, 
retransmission  interval  is  often  tied  to  the  channel  memory,  so 
we  select  8,  which  is  essentially  how  long  the  retransmission 
interval  is  relative  to  the  channel  memory. 

VII.  User  Perspective  Analysis 

While  this  paper  establish  the  analytical  results,  we  are 
currently  also  working  on  an  emulation  7  where  two  tele¬ 
phones  are  connected  via  a  PC  emulating  the  severely-blocked 
high-delay  long-memoty  channel  and  running  the  proposed 
algorithm.  As  our  initial  testing  has  revealed,  in  addition  to 
packet  delay,  playback  interruptions  also  strongly  impact  the 
user  experience.  These  playback  interruptions  occur  most  often 
early  in  the  transmission.  In  this  section,  we  analyze  how  long 
the  interruptions  are  and  how  frequently  they  occur. 

To  analyze  the  above  quantities,  we  define  a  slightly  dif¬ 
ferent  variant  of  delay.  Let  the  user  perceived  delay,  U ( t )  be 

6  If  one  were  interested  in  reducing  the  number  of  transmissions,  one  could 
potentially  hold  off  retransmissions  until  there  is  an  indication  that  the  channel 
has  opened  up. 

7Developed  by  Dr.  Mehmet  Mustafa. 


the  difference  between  how  much  has  been  generated  and  how 
much  has  been  played  back  at  time  t.  When  t  is  an  integer 
multiple  of  8,  we  have 

Uk  =  U{k8)  =  kS  —  jS,  for  k  =  1, 2,  ■  ■  ■  ,  K,  (12) 

where  j  <  k  is  the  most  recent  packet  played  back  or  0  if 
no  packet  has  been  played  back  at  all.  Values  of  U(t)  for 
t  between  integer  multiples  of  6  can  be  obtained  via  linear 
interpolation,  as  we  assume  packet  generating  and  playback 
both  happen  continuously  at  constant  rate. 

Fig.  5  shows  an  example  of  (/(f).  It  always  start  at  (7(0)  =  0 
and  initially  climbs  to  U(5+dc)  =  8+dc  (orange),  as  nothing 
can  be  played  back  for  at  least  that  long.  If  the  channel  is 
initially  open,  there  would  be  an  initial  smooth  period  (blue); 
if  the  channel  is  initially  blocked,  this  period  would  have 
zero  duration.  After  that,  Each  time  there  is  an  interruption 
in  playback,  (7(f)  would  climb  (red),  and  whenever  there  is 
smooth  playback,  (7(f)  stays  flat  (green).  We  are  interested 
in  the  expected  duration  of  each  interruption  period  and  each 
smooth  period. 

Each  interruption  occurs  when  a  blockage  duration  exceeds 
the  longest  blockage  that  had  occurred  previously,  and  ends 
when  the  channel  opens  up  again.  Since  the  channel  is 
memory  less,  independent  of  how  long  a  blockage  has  been, 
the  expected  remaining  blockage  is  always 

E [interruption  duration]  (13) 

S  _1  S  6 

Pbu{&)  P  l  —  e~5  B „  l  —  e~s 

Similarly,  each  time  the  channel  becomes  open,  the  expected 
open  duration  is  p^-.  Since  there  is  a  chance  of  p  that  the 
channel  is  initially  open,  the  expected  initial  smooth  period  is 

^[initial  smooth  duration]  (14) 

After  the  initial  smooth  period,  the  duration  of  each  smooth 
period  is  the  sum  of  a  number  of  blockage  and  open  periods. 
The  number,  denoted  by  N ,  is  random,  and  depends  on  how 
many  blockages  takes  place  before  there  is  one  that  is  longer 
than  all  the  previous  blockages.  The  probability  that  one 
blockage  lasts  (strictly)  longer  than  Z8,  Z  ■=  1,2,3,---,  is 
Pbb(8)z-  Therefore,  E[N]  =  Pbb{8)~z.  When  N  blockages 
are  required  before  an  interruption  occurs,  the  length  of  the 
smooth  period  is  the  sum  of  N  open  periods,  each  expected 
to  be  long,  N  -  1  blockage  periods  that  are  all  no 


longer  than  Z5,  and  a  final  blockage  period  that  is  exactly  Z8 
long,  during  which  the  playback  buffer  is  depleted.  Therefore, 
when  the  user  perceived  delay  U  is  8  +  dc  +  Z5,  which  is  the 
delay  resulted  from  the  longest  blockage  being  Z8,  Z  >  1, 
the  expected  smooth  duration  is 


^[smooth  duration  at  (7  =  8  +  dc  +  Z8,  Z  =  1,2,3,--- 
=  E  [JV]  ■  E'[one  open  period]  + 

( E[N ]  —  1)  •  E'fonc  blockage  period  ]  it  is  <  Z8)  +  Z8 

\\~p  + 

P 


=-*—(( 
1  —  e~s  V 


=  TV 


P(  1  -  P) 

(1  -  p  +  pe~*)~z  +  p-  1 


0 


1 


(15) 

(16) 


Fig.  6  shows  the  normalized  expected  smooth  duration  as 
a  function  of  the  normalized  excess  delay,  U  —  dc  =  (Z  + 
1)8,  for  p  =  0.8,  0.5,  and  0.2  (different  colors),  and  8  = 
0.01,  0.1,  0.5,  and  1.0  (different  line  types).  The  y-axis  is  on 
log  scale,  which  means  that  as  delay  increases,  the  smooth 
duration,  increases  exponentially.  We  see  that  for  each  p,  the 
better  performance,  longer  smooth  duration,  is  achieved  with 
smaller  8.  Similar  to  what  we  saw  with  the  packet  based  delay, 
5  =  0.1  is  nearly  as  good  as  8  =  0.01,  and  the  performance 
degradation  of  8  =  1.0  is  about  twice  that  of  5  =  0.5.  When 
p  =  0.5,  a  smooth  period  of  100  can  be  achieved  with  excess 
delay  of  about  7.  Since  each  interruption  tends  to  last  about 
1/p  =  2.  it  would  take  about  3  or  4  interruptions  before  we  can 
get  such  long  smooth  playback.  When  p  =  0.8,  the  blockage 
is  relatively  light,  a  smooth  period  of  100  can  be  achieved 
with  excess  delay  of  just  4,  which  also  corresponds  to  3  to  4 
interruptions.  When  p  =  0.2,  the  blockage  is  heavy,  a  delay 
of  15  is  required,  which  is  3  jp.  Generally,  the  rule  of  thumb 
is  that  after  one  interruption,  the  delay  and  smooth  period  are 
about  1  jp  and  10,  after  two  interruptions,  2 jp  and  30,  after 
three  interruptions,  3/p  and  100,  all  in  the  unit  of  channel 
memory  time  To- 

For  practical  implementation,  to  further  improve  user  expe¬ 
rience,  we  could  enforce  a  minimum  smooth  playback  period, 
e.g.,  each  playback  period  must  be  at  least  2  seconds  long. 
This  could  be  done  by  only  resuming  playback  after  the 
playback  buffer  has  a  certain  amount  buffered,  e.g.,  2  seconds 
worth.  Furthermore,  to  help  user  connect  segments  of  playback 
across  interruptions,  each  time  playback  resumes,  we  could  re¬ 
playback  the  tail  of  the  stream  prior  to  the  interruption.  The 
user  would  hear  a  few  words  or  syllables  repeated,  but  that 
would  help  them  bridge  the  gap  across  the  interruptions.  This 
idea  was  used  in  [4], 


VIII.  Simulation  Results 

This  section  shows  simulation  results  to  verify  the  analytical 
results.  The  channel  parameter  is  p  =  0.5,  Tq  —  1  sec,  and 
dc  =  1  sec.  For  this  channel,  the  average  open  and  blockage 
durations  are  Ty  —  Tb  =  2.  Three  values  of  8  are  used, 
0.1  sec  for  achieving  near  minimum  delay  at  the  expense  of 
bandwidth,  and  0.5  seconds  and  1.0  seconds  for  more  practical 


operation.  For  each  setting,  over  10, 000  trials  are  run  so  that 
the  error  on  the  measured  quantities  are  less  than  1%.  The 
results  are  shown  in  Table  I. 

Table  I  shows  that  for  the  throughput  metric,  the  simulation 
and  analytical  results  match  nearly  perfectly.  When  5  —  0.1, 
TM  is  very  large;  when  5  increases  to  0.1,  TM  is  reduced 
significantly;  5  =  1.0  further  reduces  TM,  but  the  gain  is 
diminishing. 

The  second  and  third  rows  show  the  expected  delay  for  short 
L  =  1  sec  and  long  L  =  100  sec  messages.  The  simulation 
results  are  slightly  less  than  the  analytical  results,  because  the 
union  bound  used  in  calculating  the  analytical  results  leads  to 
over-estimation.  Delay  increases  with  <5  as  expected. 

The  fourth  row  shows  the  expected  interruption  duration. 
Each  duration  only  lasts  2  to  3  seconds.  This  would  certainly 
be  noticeable  by  the  users.  However,  as  shown  in  (13),  this  is 
essentially  dominated  by  Tb,  which  is  the  physically  limit  of 
the  channel.  Using  5  =  0.5  and  1.0  causes  the  interruption  to 
be  about  30%  and  60%  longer,  respectively. 

The  last  two  rows  shows  the  expected  smooth  duration 
when  the  user  perceived  delay  U  is  2  seconds  and  4  seconds 
above  the  minimum  delay  due  to  channel  propagation.  As  5 
increases,  the  expected  smooth  duration  decreases  slightly.  But 
nominally,  after  having  experienced  2  seconds  of  interruption, 
the  user  can  expect  10  seconds  of  smooth  playback;  after  4 
seconds  of  interruption,  the  expected  smooth  duration  is  nearly 
half  a  minute. 


made  suggestions  on  how  to  pick  the  retransmission  interval 
5  to  achieve  reasonably  good  delay  while  not  consuming  too 
much  bandwidth.  The  general  rule  is  that  when  RTT  «  To, 
simply  use  standard  ARQ  with  5  =  RTT,  wait  a  full  RTT 
before  retransmission;  when  RTT  »  T0,  choose  time  step 
S  =  2To,  so  that  the  channel  uses  are  essentially  experience 
IID  blockage,  and  use  multi-burst  transmission  in  [5]-[7]; 
when  RTT  rs  To,  using  <5  =  p  ■  To  achieves  delay  that  is 
only  about  one  channel  memory  duration  above  the  physical 
limit  for  short  streams  and  TM  of  up  to  RTT/p. 
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IX.  Summary 

We  studied  the  problem  of  real-time  streaming  over  block¬ 
age  channel  with  long  feedback  delay  and  long  channel 
memory.  We  showed  that  most  time  quantities  scales  with  the 
channel  memory  duration  To,  including  the  desired  time  step 
size  <5,  the  packet  delay,  as  well  as  the  interruption  durations 
and  smooth  playback  durations.  We  showed  that  packet  delays 
does  not  change  much  with  stream  length  when  it  is  shorter 
than  channel  memory  duration  and  grows  logarithmically  with 
stream  length  when  it  is  large.  We  showed  that  we  can  expect 
reasonably  long  smooth  playback  periods  after  suffering  a  few 
interruptions.  We  evaluated  the  delay-throughput  tradeoff  and 


TABLE  I 

Compare  simulation  and  analytical  results  for  the  case  p  =  0.5,  T0  1  sec,  and  dc  -  1  sec 


